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A non-empirical minimal atomic multipole expansion (MAME) defines atomic charges or higher 
multipoles that reproduce electrostatic potential outside molecules. MAME eliminates problems 
associated with redundancy and with statistical sampling, and produces atomic multipoles in line 
with chemical intuition. 



The problem of representing the electrostatic poten- 
tial outside a molecule using atomic charges or higher 
atomic multipoles is very important for understanding 
intermolecular forces. Atomic partial charges, an impor- 
tant part of chemical intuition, are defined in many differ- 
ent ways for different purposes. Chemically-derived (CD) 
charges, such as MuUiken-'^ or Lowdin^, often describe 
molecular fields poorly."^ More recent schemes partition 
molecular density into atomic regions, which may or may 
not overlap.** Similar approaches have been developed for 
solids.^ Most attractive for our purposes are potential- 
derived (PD) charges, which avoid representation of the 
density by producing the 'best' fit to the molecular poten- 
tial directly.^'^ Atomic dipoles and quadrupoles^ are of- 
ten used to increase accuracy in solvation problems^ and 
force field calculations.^ Induced atomic dipoles appear 
naturally in electronic polarization of molecular solids**^ 
to account for the small part of molecular polarization 
that is due to the deformation of atomic orbitals and is 
not captured by redistribution of charges. 

Computational schemes for PD multipoles such as 
Merz-KoUman (MK)," CHelpi^, or CHelpG" differ 
mainly in the sampling domain and the resulting atomic 
charges are strongly method-dependent.^^ Worse still, 
PD methods often yield atomic charges that are counter- 
intuitive, such as negative charges on hydrogens in 
alkanes.*^ Higher multipoles only increase the redundan- 
cies inherent in distributed multipole analysis, improving 
on the accuracy of the field at the expense of instability in 
the multipole values. The severity of the problem can be 
somewhat reduced with SVD techniques,*^'*'' or by intro- 
ducing restraints.*'' Our approach does not use sampling 
and eliminates redundancies before they appear. 

We approximate the true molecular potential, 4>{v), as 
a sum of multipoles of strength qk centered at nuclear 
positions r^, 

0(r) ~ 0approx(r) = X! X! ^k<t>k{-r - (1) 

i k 

where (/)/c(r) is the potential due to the fcth multipole of 
unit strength: </'fc(r) = 1/r for charge, (n • r)/r^ for a 
dipole in the direction n, and so on. Since V^^approx = 
everywhere except at r^, but = 47r/9(r), the atomic 
multipole expansion can only be accurate in regions 
where p(r) « 0. Furthermore, on any closed surface 



S on which p = 0, determines (j)(r) everywhere outside 
S. We therefore choose S to be an isodensity surface, 
p(r) — /, where / is sufficiently small to ensure negli- 
gible charge beyond S, but with sufficient potential on 
S for a determining fit (Fig. 1). We chose 0approx to 
minimize 

^t' = 5-* £ dS [0approx(r) - 4>{r)f (2) 
over S which leads to a system of linear equations 

J2k CrakQk = bm, whcrC 

Crnk = £ dS 0™(r)0fc(r), (3a) 

b„, = S-^ £dS cl>.rn{r)cl>{r). (3b) 

Atomic multipoles defined in this way are fully rotation- 
ally invariant, which is an issue with some PD schemes.*'^ 
The error a can be compared to 0, 

'f ^ j dS (j)'^{Y). (4) 




Fig. 1 Electrostatic potential over isodensity surface S of 
pentane. Here / = 5 x 10"* au. produces 5 at ~ 1.4 A 
from the hydrogens and leaves — 0.2e charge outside. Missing 
charge is negligible for / = 10"*, with S at 1.8 A. 

The crucial issue remaining is the choice of a set of 
multipoles. We choose a minimal set, usually one scalar 
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value per atom, and add additional niultipoles to describe 
lone pairs when necessary, based on the Lewis structure. 
This carefully chosen minimal atomic multipole expan- 
sion (MAME) set avoids redundancies but is within ~ 1 
mH everywhere beyond S. 

We illustrate MAME with three molecules: n-pentanc, 
which is a classic example of difficulties encountered in 
PD schemes; glycine (standard and zwitterion), as a typ- 
ical application in biochemistry; and water, to sec how 
general MAME rules apply to a small polar molecule. All 
densities and potentials are produced on a cubic mesh by 
the Gaussian 98 program, -'^^ at the B3LYP/aug-cc-pVTZ 
level (6-311-1— l-G** for pentane). Surface integrals (3) 
are computed by triangulation of S. The program runs 
within a few seconds, and is available on request. 

Figure 1 shows (/)(r) on S for n-pentane. Red spots 
((/) > 0) show an excess of positive charge near each hy- 
drogen, but all PD schemes tested yield some or all hy- 
drogens negative. Closer inspection of Fig. 1 reveals that 
the positive regions occupy less solid angle around hydro- 
gens than would be produced by a positive charge. Such 
a potential is consistent with a dipole with a negative 
charge pointing inwards. 

Our first rule is therefore to assign a charge to all nuclei 
but protons, to which we assign a dipole moment instead. 
The hydrogen atom is special as its sole electron partic- 
ipates in the bond, leaving no electron density centered 
on the proton. This unique property of hydrogens is well- 
known in X-ray structure analysis, which systematically 
underestimates the C — H bond lengths for this reason. 

Table 1 Partial atomic charges in n-pentane. / = 5 x 10~* 
au., error a as in (2), (%)= (o"/<^), £0 ~ 3.6 mHartree 
(ImH = 27meV ~ kT at 300K). "/j," indicates atomic dipoles, 
"fir" — dipoles restricted along H — C bonds. 



Method q(H), range | q(C), range |e(T, mH (%) 

CD charges 



MuUiken +0.11.. + 


0.14 


-0 


59. 


-0 


11 


9 


3 


(260) 


ZINDO -1-0.03.. + 


0.04 


-0 


15. 


-0 


04 


3 


5 


(99) 


PD charges 


















CHelp -0.04.. + 


0.04 


-0 


11. 


+ 


15 


3 


5 


(97) 


CHelpG -0.04.. + 


0.04 


-0 


16. 


+ 


16 


3 


2 


(87) 


MK -0.03.. + 


0.06 


-0 


22. 


+ 


13 


3 


1 


(86) 


PD charges plus 


dipoles 












CHelp +/X -0.76.. + 


0.10 


-0 


66. 


+ 2 


05 


2 


8 


(78) 


CHelpG +n -0.32.. - 


0.30 


+0 


65. 


+ 


86 


1 


8 


(51) 


MK +IJ, -0.27.. - 


0.20 


+0 


50. 


+ 


64 


1 


8 


(49) 


MAME 


















charges —0.01.. -I- 


0.09 


-0 


34. 


+ 


13 


2 


6 


(72) 


Ax(H) = 0.07. 


0.09) 


-0 


01. 


+ 


03 





5 


(15) 


/ir(H) Ifi = 0.06. 


0.09) 


-0 


02. 


+ 


01 


1 


6 


(45) 



MuUiken charges arc intuitively meaningful but pro- 
duce large errors in the potential (Table 1). PD charges 
are negative on some hydrogens and still give signifi- 
cant errors. Adding dipoles reduces the potential error, 
but at the cost of producing meaningless multipoles.^^ 
Our scheme with charges on all atoms produces similar 



(though better) results, but we do far better (line 2) when 
the charges on hydrogens are replaced with dipoles. All 
dipoles come out similar in magnitude (numbers in brack- 
ets, in au.) and point toward C within 20° of the H — C 
bond. The hydrogen dipoles can be safely restricted to 
lie along the H C bonds (last line) with the accuracy 
still better than that of charges. All multipoles have rea- 
sonable values, including small charges on carbons. Note 
that wc have now described the field outside the molecule 
more accurately than any existing scheme, with only one 
parameter per nuclues (a charge on each carbon and a 
bond-directed dipole on each hydrogen). 

The same choice of multipoles yields a 1.05 mH error 
(=2%) in the glycine zwitterion, (NH3)+-CH2-COO-, 
down from 4% with charges alone and 4% — 6% with stan- 
dard PD schemes. The glycine zwitterion is highly polar 
with dipole = 10.3 D, which MAME recovers within 
0.1% accuracy. 



Table 2 MAME for glycine without and with lone pair 
multipoles. / = 10~* au., e(f> — 15 mH. 





NH2 CH2 c= 


=0 


-OH 


ecr, mH (%) 


Mr(H) 


-.05 -.03 -I-.79 


-.54 


-.17 


4.1 (27) 


/x.(H) 


+ .06 -l-.ll -I-.37 


-.67 


+ .13 




+/ir(N.O) 


-.65 


+ .19 


-.71 


1.6 (11) 


+ 0r(O) 




-..5:-! 


-.87 





Table 2 lists MAME results for glycine in its standard 
form, NH2-CH2-COOH, and illustrates the need for spe- 
cial treatment of lone pairs. In the zwitterion, the NH3 
group is well-described by a charge on N and three dipoles 

on hydrogens, similar to methyls in pentane. The NH2 
group in glycine lacks one site, but has extra electron 
density associated with the lone pair. We thus assign a 
dipole moment to N, in addition to its charge, restricted 
along the sp^ direction of the lone pair. 

Similarly, each oxygen has two lone pairs. Two dipoles 
for the two lone pairs sum to just one dipole along the 
symmetry axis, leading to only one variational parame- 
ter. The potential of this single dipole, however, is axially 
symmetric, whereas the potential around the oxygen de- 
viates from axial symmetry due to the particular orienta- 
tion of the lone pairs. Such a deviation can be accounted 
for with a quadrupole moment on the oxygen. The finite 
system of charges sketched in the inset in Fig. 2 shows 
what is needed. Computing a multipole expansion of 
three charges we describe two lone pairs with two scalar 
parameters, a dipole /itr restricted along the symmetry 
axis, and a restricted quadrupole 6r which has angle f3 as 
a fixed parameter (/3 = 120° for 0= and 109.7° for 0-, 
due to sp^ and sp^ hybridization respectively). 0r and 
/ir are chosen negative with "— " pointing outside. Table 
2 shows a clear advantage of such a multipole set. 

To make the definition more transparent, 6r can 

be expressed in the conventional form as a carefully 
crafted combination of 6zz = 6*^(3 cos^ /3/2 — 1), 9yy = 
6r{3sm^ (3/2 - 1), 0^^ = -Or, and 6xy = dxz = Oyz = 0, 
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which depend on a single scalar parameter 0r. Here z is 
along the symmetry axis, and the lone pairs are in the 
yz-plane. A single restricted quadrupole of strength Or 
creates the potential 

<Pquad[r)=0r ^ , (5) 

where ni and n2 are the directions of the lone pairs. 

For the zwitterion, MAME does not require 9r on the 
oxygens, because of the resonance. Lone pairs in the sp^ 
and sp'^ configurations lie in perpendicular planes, virtu- 
ally destroying any asymmetry. 

MAME accuracy improves away from the molecule 
(Fig. 2). If / is too large (10""^), there is a net charge 
inside S which strongly affects the asymptotic behavior. 
This can be repaired by fixing the total charge using a 
Lagrange multiplier. Figure 2 demonstrates MAME's 
insensitivity to choice of /, provided the total charge is 
correct. 
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Fig. 2 MAME accuracy away from glycine. Ratio of 
(f/iapprox — 0) to (j), both square-averaged over points with a 
given density p, which is an inverse measure of distance. The 
inset illustrates two lone pairs on oxygen represented with a 
combination of restricted dipole and quadrupole Or. 

Last, we analyze the water molecule. The first rule 
leads to two dipoles on hydrogens pointing along the 
bonds, sketched in Fig. 3. The charge on oxygen is 
zero because the molecule is neutral. The dipoles are 
equal due to symmetry and require no calculation since 
their vector sum must yield the dipole moment of water, 
I.847D (B3LYP/aug-cc-pVTZ value). This already re- 
duces the error to 21%, from 45% with a single dipole on 
oxygen. The two distributed dipoles yield Qxx — Qyy = 
4.06 dA for the quadrupole moment of water, whereas 
experiments^ gives 5.12 dA. We note that O^x — Qyy 
is the only invariant combination of quadrupole compo- 
nents, since the finite dipole makes them dependent on 
the center of coordinates. 



We next add fir and Or multipoles on the oxygen to 
describe the correction due to lone pairs, and we find ex- 
cellent accuracy ea = 0.59 mH (< 3%) on and beyond 
S (I.56A from H and 2.1lA from O, / = 10"''). The 
sp^ choice oi P — 109.5° in Or is crucial: accuracy dete- 
riorates dramatically (to 12%) when /3 is changed to e.g. 
180° {6r replaced with 6yy). The sp^ description of the 
oxygen lone pairs is appropriate due to invariance under 
unitary rotations of occupied orbitals. 




Fig. 3 Schematic MAME representation of water. 



In conclusion, molecular fields are represented to chem- 
ical accuracy with a minimal set of atomic multipoles 
carefully chosen based on the Lewis structure of the 
molecule. All H atoms are represented as dipoles. Lone 
pairs are treated with extra multipoles, avoiding addi- 
tional off-nuclear expansion sites. The scheme yields 
multipole values that conform to chemical intuition, are 
unique, fully rotationally-invariant and free of sampling 
errors. 
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